Optical Character Recognition, Using K-Nearest Neighbors

نویسنده

  • Wei Wang
چکیده

The problem of optical character recognition, OCR, has been widely discussed in the literature. Having a hand-written text, the program aims at recognizing the text. Even though there are several approaches to this issue, it is still an open problem. In this paper we would like to propose an approach that uses K-nearest neighbors algorithm, and has the accuracy of more than 90%. The training and run time is also very short.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The design of a nearest-neighbor classifier and its use for Japanese character recognition

The nearest neighbor (NN) approach is a powerfd nonparametric technique for pattern classification tasks. In this paper, algorithms for prototype reduction, hierarchical prototype organization and fast NN search are described. To remove redundant category prototypes and to avoid redundant comparisons, the algorithms exploit geometrical information of a given prototype set which is represented a...

متن کامل

Recognition of Handwritten Persian/Arabic Numerals Based on Robust Feature Set and K-NN Classifier

Persian handwritten numerals recognition has been a frontier area of research for the last few decades under pattern recognition. Recognition of handwritten numerals is a difficult task owing to various writing styles of individuals. A robust and efficient method for Persian/Arabic handwritten numerals recognition based on K Nearest Neighbors (K-NN) classifier is presented in this paper. The sy...

متن کامل

Character Recognition using Ensemble classifier

To improve the accuracy of data classification systems, several techniques using classifier fusion have been suggested. This paper proposed a model of classifier fusion for character recognition problem. The work presented here aims to tackle the disadvantages and benefit of different classifiers with varying feature sets. In particular, this approach proposes the use of statistical procedures ...

متن کامل

Printed and Handwritten Character &Number Recognition of Devanagari Script using SVM and KNN

Recognition of Devanagari scripts is challenging problems. In Optical Character Recognition [OCR], a character or symbol to be recognized can be machine printed or handwritten characters/numerals. There are several approaches that deal with problem of recognition of numerals/character. In this paper we have compared SVM and KNN on handwritten as well as on printed character and numerical databa...

متن کامل

Rejection Strategies and Confidence Measures for a k- NN Classifier in an OCR Task

In Handwritten Character Recognition, the rejection of extraneous patterns, like image noise, strokes or corrections, can improve significantly the practical usefulness of a system. In this paper, a combination of two confidence measures defined for a k-nearest neighbors classifier is proposed. Experiments are presented comparing the performance of the same system with and without the new rejec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1411.1442  شماره 

صفحات  -

تاریخ انتشار 2014